Data Lake Development with Big Data by Pradeep Pasupuleti & Beulah Salome Purra
Author:Pradeep Pasupuleti & Beulah Salome Purra [Pasupuleti, Pradeep]
Language: eng
Format: azw3
Publisher: Packt Publishing
Published: 2015-11-26T05:00:00+00:00
Addressing the limitations using Data Lake
Data Lake addresses these constraints by providing the capability to follow a write-once-run-anywhere development paradigm. This paradigm ensures that you design, code, and test your Integration data flow only once. It abstracts the underlying hardware configuration details from the development process. Once the Integration data flow has been deployed, it can be seamlessly ported onto a grid compute environment with any number of nodes. The integration data flow doesn't have to be recompiled or reconfigured in cases where the compute environment is scaled up or down due to the demand that the data places.
This approach ensures that the Data Lake scores better in terms of overall Data Integration process execution time, as all the hardware resources are effectively utilized to crunch data. This approach also draws a clear boundary that demarcates hardware configuration's effect on the ability to run code, without recompiling or reconfiguring for every change in the hardware. Hence, this fundamental liberty to literally write-once-run-anywhere gives Data Lake the ability to provide scalability on-demand seamlessly.
Download
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
Implementing Enterprise Observability for Success by Manisha Agrawal and Karun Krishnannair(7419)
Supercharging Productivity with Trello by Brittany Joiner(6679)
Mastering Tableau 2023 - Fourth Edition by Marleen Meier(6448)
Secrets of the JavaScript Ninja by John Resig Bear Bibeault(6424)
Inkscape by Example by István Szép(6297)
Visualize Complex Processes with Microsoft Visio by David J Parker & Šenaj Lelić(5992)
Build Stunning Real-time VFX with Unreal Engine 5 by Hrishikesh Andurlekar(4998)
Design Made Easy with Inkscape by Christopher Rogers(4646)
Customizing Microsoft Teams by Gopi Kondameda(4184)
Linux Device Driver Development Cookbook by Rodolfo Giometti(3940)
Business Intelligence Career Master Plan by Eduardo Chavez & Danny Moncada(3781)
Extending Microsoft Power Apps with Power Apps Component Framework by Danish Naglekar(3773)
Salesforce Platform Enterprise Architecture - Fourth Edition by Andrew Fawcett(3652)
Pandas Cookbook by Theodore Petrou(3627)
The Tableau Workshop by Sumit Gupta Sylvester Pinto Shweta Sankhe-Savale JC Gillet and Kenneth Michael Cherven(3426)
TCP IP by Todd Lammle(2994)
Drawing Shortcuts: Developing Quick Drawing Skills Using Today's Technology by Leggitt Jim(2924)
Exploring Microsoft Excel's Hidden Treasures by David Ringstrom(2895)
Applied Predictive Modeling by Max Kuhn & Kjell Johnson(2884)
